New Implementations and Results for the NAS Parallel Benchmarks 2
نویسندگان
چکیده
We present new implementations and results for the NAS Parallel Benchmarks 2 suite. The suite currently consists of seven programs. Of these LU, SP, BT, MG and FT have previously been released. Here we describe implementations of EP and IS, as well as a rewritten version of FT that corrects some problems with the original release.
منابع مشابه
OpenSHMEM Performance and Potential: A NPB Experimental Study
OpenSHMEM is a recently introduced open standard for all SHMEM libraries. In this paper we discuss the different aspects of porting the NAS parallel benchmarks from their MPI 1 implementations to those that use the new OpenSHMEM library API. We compare performance and scalability of these unoptimized OpenSHMEM NAS benchmarks with their MPI 1, and in some cases, MPI 2 counter parts. Through the ...
متن کاملThe NAS Parallel Benchmarks 2.0
We describe a set of implementations of the NAS Parallel Benchmarks based on Fortran 77 and the MPI message passing standard. These implementations, which are intended to be run with little or no tuning, approximate the performance a typical user can expect for a portable parallel program on a distributed memory computer. They complement rather than replace the original NAS Parallel Benchmarks....
متن کاملTitanium Performance and Potential: An NPB Experimental Study
Titanium is an explicitly parallel dialect of Java designed for high-performance scientific programming. We present an overview of the language features and demonstrate their use in the context of the NAS Parallel Benchmarks, a standard suite of common scientific kernels. We argue that parallel languages like Titanium provide greater expressive power than conventional approaches, enabling much ...
متن کاملParallelization of NAS Benchmarks for Shared Memory Multiprocessore
This paper presents our experiences of parallelizing the sequential implementation of NAS benchmarks using compiler directives on SGI Origin2000 distributed shared memory (DSM) system. Porting existing applications to new high performance parallel and distributed computing platforms is a challenging task. Ideally, a user develops a sequential version of the application, leaving the task of port...
متن کاملParallelization of NAS Benchmarks for Shared Memory Multiprocessors
This paper presents our experiences of parallelizing the sequential implementation of NAS benchmarks using compiler directives on SGI Origin2000 distributed shared memory (DSM) system. Porting existing applications to new high performance parallel and distributed computing platforms is a challenging task. Ideally, a user develops a sequential version of the application, leaving the task of port...
متن کامل